Markov decision process - PDFSEARCH.IO - Document Search Engine

Markov decision process
Results: 537

#	Item
101	Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu Add to Reading List Source URL: www.cs.toronto.edu Language: English - Date: 2013-12-19 11:19:32 Computational neuroscience Cybernetics Reinforcement learning Q-learning Temporal difference learning SARSA Markov decision process Unsupervised learning Recurrent neural network Machine learning Neural networks Statistics
102	A subexponential lower bound for the Least Recently Considered rule for solving linear programs and games Oliver Friedmann Department of Computer Science, University of Munich, Germany Add to Reading List Source URL: files.oliverfriedmann.de Language: English - Date: 2012-03-20 09:22:12 Stochastic control Statistics Dynamic programming Markov decision process
103	This paper was presented as part of the main technical program at IEEE INFOCOMA Nearly-Optimal Index Rule for Scheduling of Users with Abandonment Urtzi Ayesta∗† , Peter Jacko∗ and Vladimir Novak∗‡ ∗ B Add to Reading List Source URL: homepages.laas.fr Language: English - Date: 2012-01-01 05:04:19 Systems theory Dynamic programming Equations Markov processes Decision theory Loss function Bellman equation Asymptotically optimal algorithm Markov decision process Statistics Control theory Mathematical optimization
104	Subexponential lower bounds for randomized pivoting rules for solving linear programs Oliver Friedmann ∗ Add to Reading List Source URL: files.oliverfriedmann.de Language: English - Date: 2012-02-10 07:43:14 Operations research Linear programming Simplex algorithm Markov decision process LP-type problem Reinforcement learning Algorithm Simplex SL Mathematics Applied mathematics Geometry
105	Performance Evaluation Performance Evaluation–22 A Modeling Framework for Optimizing the Flow-Level Scheduling with Time-Varying Channels Add to Reading List Source URL: homepages.laas.fr Language: English - Date: 2012-01-01 05:03:03 Markov chain Scheduling Gittins index Dynamic programming Markov decision process Statistics Markov processes Operations research
106	Department of Mathematics Texas A&M University 3368 TAMU College Station, TXF Add to Reading List Source URL: see-math.math.tamu.edu Language: English - Date: 2015-02-14 23:22:02 Computational number theory Cryptography Factorization of polynomials over a finite field and irreducibility tests Polynomials Markov decision process Mathematics Coding theory Algebra
107	A DISSERTATION IN ARTIFICIAL INTELLIGENCE Evolutionary Dynamics of Reinforcement Learning Algorithms in Strategic Interactions Add to Reading List Source URL: michaelkaisers.com Language: English - Date: 2012-12-03 03:18:47 Machine learning Q-learning Reinforcement Markov decision process Statistics Learning Reinforcement learning
108	Submodular Surrogates for Value of Information Yuxin Chen Shervin Javdani Amin Karbasi Add to Reading List Source URL: www.ri.cmu.edu Language: English - Date: 2015-01-12 10:51:27 Design of experiments Mathematical optimization Systems engineering Submodular set function Statistical hypothesis testing Optimal decision Markov decision process Optimal design Optimal control Statistics Operations research Decision theory
109	A subexponential lower bound for Zadeh’s pivoting rule for solving linear programs and games Oliver Friedmann Department of Computer Science, University of Munich, Add to Reading List Source URL: files.oliverfriedmann.de Language: English - Date: 2012-02-10 07:43:23 Mathematical optimization Dynamic programming Markov decision process Stochastic control Reinforcement learning Linear programming Simplex algorithm Statistics Operations research Mathematics
110	Balancing Anarchy and Central Control Individual vs. Joint Action Reinforcement Learning Daniel Claes June 18, 2010 Abstract Add to Reading List Source URL: michaelkaisers.com Language: English - Date: 2012-04-29 08:03:28 Artificial intelligence Cognitive science Machine learning Dynamic programming Markov processes Q-learning Reinforcement learning Markov decision process Temporal difference learning Statistics Science Learning